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REMARKS 

Claims 1 and 3-48 were examined by the Office, and claims 1, 3-38 and 40-48 are 
rejected, claim 39 has not been properly rejected in this office action. The Office has committed 
clear error by failing to provide any grounds for rejecting claim 39, and by failing to show that 
each and every element recited in the claims of the present application are disclosed or suggested 
by the cited references. See MPEP § 2131. For at least these reasons, applicant respectfully 
requests reconsideration and withdrawal of the rejections to the claims. 

This Pre- Appeal Brief Request for Review is submitted along with a Notice of Appeal. 

Claim Rejections Under § 102 

In section 3, on page 5 of the Office Action, claims 1, 3-14, 19-38 and 40-48 are rejected 
under 35 U.S.C. 102(b) as being anticipated by Gersho et al. (U.S. Patent No. 6,31 1,154). 
Applicant respectfully submits that claim 1 is not disclosed or suggested by Gersho, because 
Gersho at least fails to disclose or suggest segmenting an audio signal into a plurality of 
segments based on the audio characteristics of the audio signal, and a parametric-type encoding 
method, as recited in claim 1 . 

The Office has committed clear error in rejecting claim 1 based on Gersho, because 
Gersho does not disclose or suggest segmenting an audio signal into a plurality of segments 
based on the audio characteristics of the audio signal. In contrast to claim 1, Gersho discloses 
that the speech waveform is partitioned into a sequence of successive frames, each frame has a 
fixed length, and each frame is then partitioned into a number of equal length subframes. See 
Gersho column 1, lines 55-58; see also column 7, lines 23-26 (each basic frame is partitioned 
into M equal length subframes). Gersho cannot disclose segmenting based on the audio 
characteristics as recited in claim 1, because the speech waveform is partitioned into fixed length 
frames regardless of the audio characteristics. The method for coding a speech signal disclosed 
by Gersho includes first partitioning samples of a speech signal into frames, and then classifying 
the speech signal in each of the frames into one of a plurality of classes. See Gersho column 4, 
lines 25-27. The method discussed in Gersho cannot classify speech samples before partitioning, 
because the frames that are classified into a class, i.e. the frames must first exist before they can 
be classified. In fact the Office acknowledges on page 2 of the Office Action, that when the 
energy peak is determined, the speech signal is already partitioned into frames. Therefore, the 
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characteristics of the speech signal are not determined until after the speech signal has been 
partitioned into frames. In contrast to claim 1, Gersho determines where to set the boundary of 
each frame when partitioning the samples without any regard for the audio characteristics of the 
speech signal in the frames. Since frames are not classified until after partitioning, and it is 
impossible to partition the speech signal based on classes before classifying the speech signal, 
Gersho cannot disclose or suggest segmenting the audio signal into segments based on the audio 
characteristics of the audio signal, as recited in claim 1 . In contrast, claim 1 recites that audio 
signals are segmented based on the audio characteristics in the audio signals. In claims 1 the 
segmenting depends on the audio characteristics of the audio signal. Because the audio 
characteristics of the audio signal may vary from sample to sample, the boundary of the 
segments is not pre-determined. As a result, a segment can be long or short; it can be 10 frames 
or 28 frames (see e.g. Figure 3). In Gersho, the length of each partitioned "segment" is the same. 
See Gersho column 7, lines 23-26. 

Furthermore, contrary to the assertions of the Office, partitioning the speech into frames 
and sub-frames, and enhancing performance by coding the important segments of the excitation 
more accurately is not the equivalent of segmenting based on audio characteristics, as recited in 
claim 1. Gersho only discloses that the frames of the speech signal is encoded based on classes, 
which are based on the nature and amount of information contained in the frames. See Gersho 
column 3, lines 57-61. In order to enhance the coding efficiency, Gersho discusses coding the 
excitation signal in windows depending on the classification of the speech frames. Gersho also 
discloses dividing a fixed frame into a number of subframes for the purpose of locating the active 
periods (i.e., windows) of the excitation signal in the subframes. However, Gersho does not 
disclose or suggest segmenting each fixed frame into a plurality of subframes based on the audio 
characteristics of the audio signal in the fixed frame. Instead, Gersho only discloses coding the 
excitation in the subframes depending on the audio characteristics of the fixed frame, and then 
classifying the speech signal in each of the fixed frames into different classes using two 
classifiers. See Gersho column 4, lines 51-55. Classification is only carried out by a classifier 
after the speech is partitioned into frames and subframes. After classification, frames belonging 
to a category are coded by a coding method that represents the excitation in those categories. 

Therefore, Gersho does not disclose or suggest segmenting the audio signal into a 
plurality of segments based on the audio characteristics of the audio signal. For at least this 
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reason, Gersho fails to disclose or suggest all of the limitations recited in claim 1, and the Office 
has committed clear error in rejecting claim 1. 

Furthermore, Gersho also fails to disclose or suggest parametric audio coding as recited 
in claim 1, because Gersho is concerned with a CELP-type encoding method. In the parametric- 
type encoding method recited in claim 1 , a parametric speech production model is used to obtain 
a set of parameters from the audio signal so as to produce a further audio signal in the decoder 
based on the parameters. The parametric-type encoding and decoding method, for example as 
discussed in the specification of the present application, does not rely on the waveform of the 
speech signal segments. In fact, due to the loss of the synchrony between the coder input and 
output signal, waveform matching is not carried out. 

A CELP coder is an example of an Analysis-by-Synthesis (AbS) coder. See Gersho 
column 1 , line 54 — column 2, line 1 . As known in the art, a CELP coder performs waveform 
matching on the coder output using code excitation candidates and selecting the one minimizing 
given error criteria. As disclosed in Gersho, the CELP coder relies on the residual and excitation 
models. Gersho's coder is not a parametric coder as disclosed in the present invention. 

Therefore, for the reasons discussed above, independent claims 1,19, 22, 26, 27, 31 and 
32 are not disclosed or suggested by Gersho. 

Claims 3-14, 20, 21, 23-25, 28-30 and 33-48, they are dependent from claims 1, 19, 22, 
26, 27 and 31, recite features not recited in claims 1,19, 22, 26, 27 and 31, and therefore are 
patentable at least in view of their dependencies. 

Claim Rejections Under $ 103 

In section 4, on page 15 of the final Office Action, claims 15-18 are rejected under 35 
U.S.C. 103(a) as being unpatentable over Gersho, in view of Gersho (IEEE-96). 

It is respectfully submitted that claims 15-18 are dependent from claims 1, recite features 
not recited in claim 1, and therefore are patentable over the cited references at least in view of 
their dependencies. 



Claim 39 

Applicant respectfully submits that the Office has committed clear error by completely 
failing to provide any grounds for rejecting claim 39, or addressing claim 39 in any manner. 
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Claim 39 is not addressing in either the § 102 or § 103 claim rejections. Therefore, for at least 
this additional reason, applicant respectfully requests withdrawal of the final Office Action. 

Conclusion 

For at least the reasons discussed above, applicant respectfully submits that the Office has 
committed clear error in rejecting the claims of the present application, as such the present 
application is believed to be in condition for allowance, and such action is earnestly solicited. 
The undersigned hereby authorizes the Commissioner to charge Deposit Account No. 23-0442 
for any fee deficiency required to submit this response. 

Respectfully submitted, 
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